NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Kingdom-wide CRISPR guide design with ALLEGRO

https://doi.org/10.1093/nar/gkaf783

Mohseni, Amirsadra; Nia, Reyhane_Ghorbani; Tafrishi, Aida; López, Mario León; Liu, Xin-Zhan; Stajich, Jason E.; Lonardi, Stefano; Wheeldon, Ian (August 2025, Nucleic Acids Research)

Abstract Designing CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) single guide RNA (sgRNA) libraries targeting entire kingdoms of life will significantly advance genetic research in diverse and underexplored taxa. Current sgRNA design tools are often species-specific and fail to scale to large, phylogenetically diverse datasets, limiting their applicability to comparative genomics, evolutionary studies, and biotechnology. Here, we introduce ALLEGRO, a combinatorial optimization algorithm designed to compose minimal, yet highly effective sgRNA libraries targeting thousands of species at the same time. Leveraging integer linear programming, ALLEGRO identified compact sgRNA sets simultaneously targeting multiple genes of interest for over 2000 species across the fungal kingdom. We experimentally validated sgRNAs designed by ALLEGRO in Kluyveromyces marxianus, Komagataella phaffii, Yarrowia lipolytica, and Saccharomyces cerevisiae, confirming successful genome edits. Additionally, we employed a generalized Cas9–ribonucleoprotein delivery system to apply ALLEGRO’s sgRNA libraries to untested fungal genomes, such as Rhodotorula araucariae. Our experimental findings, together with cross-validation, demonstrate that ALLEGRO facilitates efficient CRISPR genome editing, enabling the development of universal sgRNA libraries applicable to entire taxonomic groups.
more » « less
Kingdom-Wide CRISPR Guide Design with ALLEGRO

https://doi.org/10.1101/2025.02.13.638206

Mohseni, Amirsadra; Nia, Reyhane Ghorbani; Tafrishi, Aida; Liu, Xin-Zhan; Stajich, Jason E; Wheeldon, Ian; Lonardi, Stefano (February 2025, bioRxiv)

Abstract Designing CRISPR single guide RNA (sgRNA) libraries targeting entire kingdoms of life will significantly advance genetic research in diverse and underexplored taxa. Current sgRNA design tools are often species-specific and fail to scale to large, phylogenetically diverse datasets, limiting their applicability to comparative genomics, evolutionary studies, and biotechnology. Here, we present ALLEGRO, a combinatorial optimization algorithm able to design minimal, yet highly effective sgRNA libraries targeting thousands of species. Leveraging integer linear programming, ALLEGRO identified compact sgRNA sets simultaneously targeting several genes of interest for over 2,000 species across the fungal kingdom. We experimentally validated the sgRNAs designed by ALLEGRO inKluyveromyces marxianus, Komagataella phaffii, andYarrowia lipolytica. In addition, we adopted a generalized Cas9-Ribonucleoprotein delivery system coupled with protoplast transformation to extend ALLEGRO’s sgRNA libraries to other untested fungal genomes, such asRhodotorula araucariae. Our experimental results, along with cross-validation, show that ALLEGRO enables efficient CRISPR genome editing, supporting the development of universal sgRNA libraries applicable to entire taxonomic groups.
more » « less
Free, publicly-accessible full text available February 17, 2026
acCRISPR: an activity-correction method for improving the accuracy of CRISPR screens

https://doi.org/10.1038/s42003-023-04996-8

Ramesh, Adithya; Trivedi, Varun; Lee, Sangcheon; Tafrishi, Aida; Schwartz, Cory; Mohseni, Amirsadra; Li, Mengwan; Lonardi, Stefano; Wheeldon, Ian (June 2023, Communications Biology)

Abstract High throughput CRISPR screens are revolutionizing the way scientists unravel the genetic underpinnings of engineered and evolved phenotypes. One of the critical challenges in accurately assessing screening outcomes is accounting for the variability in sgRNA cutting efficiency. Poorly active guides targeting genes essential to screening conditions obscure the growth defects that are expected from disrupting them. Here, we develop acCRISPR, an end-to-end pipeline that identifies essential genes in pooled CRISPR screens using sgRNA read counts obtained from next-generation sequencing. acCRISPR uses experimentally determined cutting efficiencies for each guide in the library to provide an activity correction to the screening outcomes via calculation of an optimization metric, thus determining the fitness effect of disrupted genes. CRISPR-Cas9 and -Cas12a screens were carried out in the non-conventional oleaginous yeastYarrowia lipolyticaand acCRISPR was used to determine a high-confidence set of essential genes for growth under glucose, a common carbon source used for the industrial production of oleochemicals. acCRISPR was also used in screens quantifying relative cellular fitness under high salt conditions to identify genes that were related to salt tolerance. Collectively, this work presents an experimental-computational framework for CRISPR-based functional genomics studies that may be expanded to other non-conventional organisms of interest.
more » « less
Balanced Training Sets Improve Deep Learning-Based Prediction of CRISPR sgRNA Activity

https://doi.org/10.1021/acssynbio.4c00542

Trivedi, Varun; Mohseni, Amirsadra; Lonardi, Stefano; Wheeldon, Ian (November 2024, ACS Synthetic Biology)
Reference-agnostic representation and visualization of pan-genomes

https://doi.org/10.1186/s12859-021-04424-w

Liang, Qihua; Lonardi, Stefano (October 2021, BMC Bioinformatics)

Abstract BackgroundThe pan-genome of a species is the union of the genes and non-coding sequences present in all individuals (cultivar, accessions, or strains) within that species. ResultsHere we introduce PGV, a reference-agnostic representation of the pan-genome of a species based on the notion of consensus ordering. Our experimental results demonstrate that PGV enables an intuitive, effective and interactive visualization of a pan-genome by providing a genome browser that can elucidate complex structural genomic variations. ConclusionsThe PGV software can be installed via conda or downloaded fromhttps://github.com/ucrbioinfo/PGV. The companion PGV browser athttp://pgv.cs.ucr.educan be tested using example bed tracks available from the GitHub page.
more » « less
Prediction of histone post-translational modifications using deep learning

https://doi.org/10.1093/bioinformatics/btaa1075

Baisya, Dipankar Ranjan; Lonardi, Stefano (December 2020, Bioinformatics)
Cowen, Lenore (Ed.)
Abstract Motivation Histone post-translational modifications (PTMs) are involved in a variety of essential regulatory processes in the cell, including transcription control. Recent studies have shown that histone PTMs can be accurately predicted from the knowledge of transcription factor binding or DNase hypersensitivity data. Similarly, it has been shown that one can predict PTMs from the underlying DNA primary sequence. Results In this study, we introduce a deep learning architecture called DeepPTM for predicting histone PTMs from transcription factor binding data and the primary DNA sequence. Extensive experimental results show that our deep learning model outperforms the prediction accuracy of the model proposed in Benveniste et al. (PNAS 2014) and DeepHistone (BMC Genomics 2019). The competitive advantage of our framework lies in the synergistic use of deep learning combined with an effective pre-processing step. Our classification framework has also enabled the discovery that the knowledge of a small subset of transcription factors (which are histone-PTM and cell-type-specific) can provide almost the same prediction accuracy that can be obtained using all the transcription factors data. Availabilityand implementation https://github.com/dDipankar/DeepPTM. Supplementary information Supplementary data are available at Bioinformatics online.
more » « less
Full Text Available
DeeplyEssential: a deep neural network for predicting essential genes in microbes

https://doi.org/10.1186/s12859-020-03688-y

Hasan, Md Abid; Lonardi, Stefano (September 2020, BMC Bioinformatics)
null (Ed.)
Abstract Background Essential genes are those genes that are critical for the survival of an organism. The prediction of essential genes in bacteria can provide targets for the design of novel antibiotic compounds or antimicrobial strategies. Results We propose a deep neural network for predicting essential genes in microbes. Our architecture called DeeplyEssential makes minimal assumptions about the input data (i.e., it only uses gene primary sequence and the corresponding protein sequence) to carry out the prediction thus maximizing its practical application compared to existing predictors that require structural or topological features which might not be readily available. We also expose and study a hidden performance bias that effected previous classifiers. Extensive results show that DeeplyEssential outperform existing classifiers that either employ down-sampling to balance the training set or use clustering to exclude multiple copies of orthologous genes. Conclusion Deep neural network architectures can efficiently predict whether a microbial gene is essential (or not) using only its sequence information.
more » « less
Full Text Available
OMGS: Optical Map-Based Genome Scaffolding

https://doi.org/10.1089/cmb.2019.0310

Pan, Weihua; Jiang, Tao; Lonardi, Stefano (April 2020, Journal of Computational Biology)

Full Text Available
Selfish: discovery of differential chromatin interactions via a self-similarity measure

https://doi.org/10.1093/bioinformatics/btz362

Ardakany, Abbas_Roayaei; Ay, Ferhat; Lonardi, Stefano (July 2019, Bioinformatics)

Abstract MotivationHigh-throughput conformation capture experiments, such as Hi-C provide genome-wide maps of chromatin interactions, enabling life scientists to investigate the role of the three-dimensional structure of genomes in gene regulation and other essential cellular functions. A fundamental problem in the analysis of Hi-C data is how to compare two contact maps derived from Hi-C experiments. Detecting similarities and differences between contact maps are critical in evaluating the reproducibility of replicate experiments and for identifying differential genomic regions with biological significance. Due to the complexity of chromatin conformations and the presence of technology-driven and sequence-specific biases, the comparative analysis of Hi-C data is analytically and computationally challenging. ResultsWe present a novel method called Selfish for the comparative analysis of Hi-C data that takes advantage of the structural self-similarity in contact maps. We define a novel self-similarity measure to design algorithms for (i) measuring reproducibility for Hi-C replicate experiments and (ii) finding differential chromatin interactions between two contact maps. Extensive experimental results on simulated and real data show that Selfish is more accurate and robust than state-of-the-art methods. Availability and implementationhttps://github.com/ucrbioinfo/Selfish
more » « less
Accurate detection of chimeric contigs via Bionano optical maps

https://doi.org/10.1093/bioinformatics/bty850

Pan, Weihua; Lonardi, Stefano; Berger, ed., Bonnie (October 2018, Bioinformatics)

Abstract SummaryA chimeric contig is contig that has been incorrectly assembled, i.e. a contig that contains one or more mis-joins. The detection of chimeric contigs can be carried out either by aligning assembled contigs to genome-wide maps (e.g. genetic, physical or optical maps) or by mapping sequenced reads to the assembled contigs. Here, we introduce a software tool called Chimericognizer that takes advantage of one or more Bionano Genomics optical maps to accurately detect and correct chimeric contigs. Experimental results show that Chimericognizer is very accurate, and significantly better than the chimeric detection method offered by the Bionano Hybrid Scaffold pipeline. Chimericognizer can also detect and correct chimeric optical molecules. Availability and implementationhttps://github.com/ucrbioinfo/Chimericognizer Supplementary informationSupplementary data are available at Bioinformatics online.
more » « less

« Prev Next »

Search for: All records